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A METHOD AND SYSTEM FOR EFFICIENTLY EVALUATING AND 



DRAWING NURBS SURFACES FOR 3D GRAPHICS 



FIELD OF THE INVENTION 

The field of the present invention pertains to computer implemented 
graphics. More particularly, the present invention relates to a system and 
10 method for rendering parametric curved siirfaces in a three dimensional graphics 
environment. 

BACKGROLW) OF THE INVENTION 



15 Maiiy different areas of business, industry, government, education, 

entertainment, and most recently, the home, are tapping into the enomious and 
rapidly growing Hst of applications developed for today's increasingly powerful 
computer devices. Graphical user interfaces have replaced textual interfaces as 
the standard means for user computer interaction. 



Graphics have also become a key technology for commimicating ideas, 
data, and trends in most areas of commerce, science, and education. Until 
recently, real time user interaction with three dimensional (3D) models and 
pseudo-reaUstic images was feasible on only very high performance workstations. 



Computer graphics are being used today to perform a wide variety of tasks. 



20 
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These workstations contain dedicated, special piirpose graphics hardware. The 
progress of semiconductor fabrication technology has made it possible to do real 
time 3D animation, with color shaded images of complex objects, described by 
thousands of polygons, on powerful dedicated rendering subsystems. The most 
5 recent and most powerful workstations are capable of rendering completely life- 
like, reahstically hghted, 3D objects and structures. 

In a typical 3D computer generated object, the surfaces of the 3D object 
are described by data models. These data models store "primitives" (usually 

10 mathematically described polygons and polyhedra) that define the shape of the 
object, the object attributes, and the connectivity and positioning data describing 
how the objects fit together. The component polygons and polyhedra connect at 
common edges defined in terms of common vertices and enclosed volimies. The 
polygons are textured, Z-bufifered, and shaded onto an array of pixels, creating a 

15 reahstic 3D image. 

Many applications require the generation of smooth surfaces and smooth 
curves. To realistically generate a real-world image, the sxirfaces of objects in the 
image need to be realistically modeled. The most common representation for 3D 
20 STirfaces are "polygon meshes." A polygon mesh is a set of connected, polygonal 
boimded, planar surfaces. Open boxes, cabinets, and building exteriors can be 
easily and naturally represented by polygon meshes. Polygon meshes, however, 
are less easily used to represent objects with cxirved surfaces. 

25 Referring now to prior art Figure lA, a simple polygon mesh of a section 10 

of a patch is shown, while prior art Figure IB shows an actual section 11 of a 
patch. Each polygon of the section 10 is a mathematical representation of a 
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corresponding portion of the surface of the section 11. The interconnecting 
vertices (e.g., vertices 13) and the interconnecting edges of the polygons (e.g., 
edges 12) collectively define the surface of section 10 in 3D space. There are 
obvious short comings, however, in the accuracy of the section 10 in comparison 
5 to actual section 11. The polygons model the surface of section 11, but the 
representation is only approximate. The "errors" of the approximation can be 
made arbitrarily small by using more and more polygons to create an increasingly 
accurate piecewise linear approximation. 

10 Referring now to prior art Figure 2A, prior art Figure 2B, and prior art 

Figure 2C, an initial representation 21, an intermediate representation 22, and a 
final representation 23, are shown respectively. The initial representation 21 is a 
polygon mesh of a rain drop. As described above, initial representation 21 consists 
of a fewer number of polygons, leading to a "blocky," or geometrically aliased, 

15 representation of the rain drop. It should be noted that the polygons of initial 

representation 21 are comprised of triangles, whereas the polygons of section 10 
are comprised of quadrilaterals. Regardless of the nature of the polygon primitive 
(triangle, quadrilateral, and the like) used to model a curved surface, the general 
properties, e.g., geometric aliasing, are substantially the same. 

20, 

In proceeding fi-om initial representation 21 to intermediate representation 
. 22, and to final representation 23, the number of polygons in the polygon mesh 
modehng the rain drop are greatly increased. The final representation 23 contains 
several orders of magnitude more polygons than initial representation 21. This 
25 3delds a much more accurate piecewise linear approximation of the rain drop. 
Computer graphics engineers rely upon a techniques for modeling and 
manipulating the curved surfaces of the objects (e.g., the rain drop of final 
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representation 23). For engineering and computer aided design applications, it is 
often highly advantageous to have as accurate a model as possible. 

For example, engineers modeling the surface of the exterior body of an 
5 automobile, or modeUng a new mechanical part for a jet engine, desire models 
which are as accurate as possible. The models sho;ald enable accxirate 
visuaUzation of their designs. The models should also be easily manipulated and 
transformed, allowing engineers to alter a design, visualize the result of the 
alteration, and subsequently alter the design again. 

10 

While modeling objects with ciirved surfaces can be accomplished using 
polygon meshes having very large numbers of polygons, such models are 
computationally very unwieldy. If an object is stored as a large polygon mesh, the 
demands upon the data transfer bandwidth of the workstations, the demands 

15 upon the processor power, demands upon memory for storing such models, and 
other such problems, combine to make fast easy intuitive 3D interaction and 
iterative design alteration very slow (if not impossible). As a result, the 3D 
workstation industry utilizes mathematical parametric representations of the 
curved surfaces of an object to define and model the object. Parametric models 

20 also allow an artist or designer to easily manipulate the global shapes of surfaces 
through a small number of interactions. Polyhedral models are computationally 
very intensive, requiring many vertices to be repositioned for even a minor change 
in the overall shape of the modeled object. 

25 In a parametric representation, the curves of an object (e.g., the rain drop 

of Figure 2) are defined mathematically using equations. The equations 
collectively form what is referred to as a parametric representation of the object. 
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specifically, the curved surfaces of the object. Parametric representations 
overcome the problems posed by a polyhedral representation (e.g., a polygon 
mesh). The actual shape of the object can be very closely approximated using a 
parametric representation. This representation consumes several orders of 
5 magnitude less space than an equivalent polyhedral representation 

Thus, for example, the rain drop of Figiu-e 2C can be defined and stored as a 
parametric representation. This representation can be easily manipulated, easily 
altered, and easily transformed, among other advantages, in comparison to the 
10 polyhedral representation (e.g., final representation 23). 

Perhaps the most vddely used form of curved surface parametric 
representation are non-uniform rational B-spline representations (NURBS). 
NURBS are widely used in the computer aided design (CAD) indiistry to model 

15 curved surfaces. Among their many benefits, NURBS have two important 
advantages relative to other forms of parametric representation. The first 
advantage is that they are invariant under projective transformations. The 
second advantage is that they can very accurately define virtually any curved 
surface. Thus, NURBS are widely used in the field of 3D CAD. While parametric 

20 representations of curved surfaces and NURBS are discussed herein, the 
mathematical form and characteristics of curved siuface parametric 
representation and NURBS are well known in the art. Those desiring a more 
extensive description of curved surface parametric representation and NURBS 
should consult "Gerald Farin, CURVES AND SURFACES FOR COMPUTER 

25 AIDED DESIGN, ISBN 0-12-249051-7*' which is incorporated herein as 
background material. 
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The problem with NURBS, however, is the fact that the NURBS model is 
not "natively" rendered by the dedicated rendering hardware of prior art 3D 
graphics workstations. The rendering hardware of typical 3D graphics 
workstations are designed and optimized to function with polygons. The rendering 
5 algorithms, the special pxirpose rendering hardware, and the display hardware are 
designed to manipulate and display images generated from polygonal graphics 
primitives. Thus, the NURBS models need to be transformed using software into 
polygon meshes prior to rendering. This software executes on the host 
processor(s) or graphics co-processor(s), operates on the NURBS model, and 

10 creates a resulting polygon mesh stored in the memory of the 3D graphics 
workstation. The resulting polygon mesh is then rendered by the dedicated 
rendering hardware into a 3D image on a display. When an engineer makes a 
change to the NURBS model, the parameters of the NURBS model are changed, a 
new resTilting polygon mesh is created, and the changed image is subsequently 

15 displayed. 

This prior art rendering process, however, is slow. The specific rendering 
software which transforms the NURBS model into the resulting polygon mesh is 
computationally very expensive. Large nimibers of clock cycles are consumed by 
20 the specific rendering software. This greatly detracts fi-om the desired 

"interactive" quahty of modem CAD. Additionally, other programs executing 
concurrently on the workstation are adversely impacted. 

The prior art rendering process makes very large demands on the data 
25 transfer bandwidth of the 3D graphics workstation. The specific rendering 

software executes on the host processor(s) or graphics co-processor(s) and needs 
to move large amounts of data across the busses of the 3D graphics workstation 
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(e.g., transfers to and from main memory or from graphics co-processors to 
polygon rasterizer(s)). This also adversely impacts other programs and processes 
running on the 3D graphics workstation. 

5 In addition, the prior art rendering process does not utilize the dedicated 

rendering hardware of the 3D graphics workstation. Modem 3D graphics 
workstations often include very sophisticated and very extensive rendering 
subsystems. These subsystems are piirposely designed to accelerate the 3D 
graphics rendering process. However, the specific rendering software, as described 
10 above, executes on the host processor(s) or graphics co-processor(s) and does not 
make use of the dedicated rendering hardware until the resulting polygon mesh is 
completely computed. 

Thus, what is needed is a method which radically speeds up the 3D graphics 
15 rendering process. The method should greatly speed the process of rendering 

NURBS models. What is further required is a method which accurately renders 
NURBS models and does not undtdy burden the data transfer bandwidth of the 3D 
graphics workstation, or consume an inordinate amount of host processor or 
graphics co-processor clock cycles. The method of the present invention provides 
20 a novel solution to the above requirements. 
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SUMMARY OF THF. TNVENTION 

The present invention provides a method and a system which greatly 
increases the speed of the graphics rendering process. The method and system of 
the present invention increases the speed and efficiency of the process of 
5 rendering NURBS models. The method and system of the present invention 
accurately renders NURBS models without burdening the data transfer 
bandwidth of the computer system. In addition, the method and system of the 
present invention does not consimie an inordinate amoxint of host processor or 
graphics co-processor clock cycles. In so doing, the present invention provides a 
10 fast and efficient process for rendering NURBS models. 

In one embodiment, the present invention comprises a computer 
implemented process and system for rendering curved lines and surfaces as 3D 
graphics on a display. The system of the present invention includes a computer 

15 system having a processor, a bus, and a 3D graphics rendering pipeline. The 
cvirves and surfaces are modeled by non-imiform rational B-spUnes (NURBS). 
The process of the present invention functions by receiving a NURBS model for 
rendering from a software program running on the host processor. The NURBS 
model defines a curve or surface. The process of the present invention efficiently 

2 0 converts the NURBS model to a Bezier model using the hardware of the graphics 
rendering pipeline. The Bezier model describes the same curve or surface. The 
process of the present invention subsequently generates a plurality of points on 
the curve or surface using the Bezier model and the graphics rendering pipeline. 
The points on the surface are then used by the graphics rendering pipeUne to 

25 render the curve or surface defined by the Bezier model. In so doing, the present 
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invention provides a fast and efficient process for rendering NURBS models, 
hence, greatly increasing the speed of the graphics rendering process. 

In another embodiment, the present invention renders a curve or surface 
5 directly from a NURBS model defining the curve or surface. In this embodiment, 
the present invention uses the graphics rendering pipeline to directly evaluate the 
NURBS control points of the NURBS model into points on a ciorve or siirface. 
The points on the curve or surface are then used by the graphics rendering 
pipeline to render the curve or surface, in the manner described above. 

10 

Additionally, the present invention includes a method for automatically 
generating normals (e.g., vectors normal to the surface) for a surface defined by a 
NURBS model. After points on the surface have been generated, in the manner 
described above, the present invention uses the graphics rendering pipeline to 
15 generate normals for the surface. The normals are subsequently used to render 
reaUstic lighting and shading effects onto the surface. 
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RRTF.F DESCRIPTION OF THE DRAWINGS 

The present invention is illustrated by way of example and not by way of 
limitation, in the figures of the accompanying drawings and in which like reference 
numerals refer to similar elements and in which: 

5 

Prior Art Figure lA shows a simple polygon mesh of a section of a patch of 
the prior art. 

Prior Art Figure IB shows an actual section of the patch of the prior art. 

10 

Prior Art Figure 2A shows an initial representation polygon mesh of the 
prior art. 

Prior Art Figure 2B shows an intermediate representation polygon mesh of 
15 the prior art. 

Prior Art Figure 2C shows a final representation polygon mesh of the prior 

art. 

20 . Figure 3 shows a graphics rendering system in accordance with one 

embodiment of the present invention. 

Figure 4 shows a block diagram of a geometry subsystem and a raster 
subsystem in accordance with one embodiment of the present invention, 

25 
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Figure 5 shows a block diagram of a tri-linear interpolator in accordance 
with one embodiment of the present invention. 

Figure 6 shows a geometric spatial representation of four control points of a 
5 Bezier curve. 

Figure 7A shows the four control points from Figure 6 plugged into the 
inputs of the tri-linear interpolator from Figure 5 in accordance with one 
embodiment of the present invention. 

Figure 7B shows a geometric spatial representation of the foxir Bezier 
control points, their respective intermediate control points, and the resulting point 
on the cxirve in accordance with one embodiment of the present invention. 

Figure 7C shows a curve defined by the Bezier control points from Figure 

7B. 

Figure 8A shows the tri-linear interpolator from Figure 5 with Bezier control 
pouit inputs in blossom notation in accordance with one embodiment of the 
present invention. 

Figure 8B, shows a geometric spatial representation of the four Bezier 
control points and intermediate control points, in blossom notation, in accordance 
with one embodiment of the present invention. 
25 

Figure 9A shows the tri-linear interpolator from Figure 5 having NURBS 
control point inputs in accordance with one embodiment of the present invention. 
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Figure 9B shows a geometric spatial representation of a NURBS to Bezier 
conversion in accordance with one embodiment of the present invention. 

5 Figure 10 shows a table illustrating the relationship between the LERP 

weights of a tri-Unear interpolator and the resulting Bezier control points in 
accordance with one embodiment of the present invention. 

Figxire llA shows a plurahty of look up tables used in an alternate 
10 embodiment of the present invention. 

Figure IIB shows a tri-linear filter which implements the direct NURBS 
evaluation. 

15 Figure 12A shows a diagram of the process of automatic normal generation 

in accordance with the present invention. 

Figure 12B shows a diagram of an optimized process of automatic normal 
generation. 

20 

Figure 13A shows a tri-linear interpolator having inputs implementing 
surface partial evaluation in accordance with the present invention. 

Figure 13B shows a spatial representation of the surface partial evaluation 
25 process. 
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Figxire 14 shows a flowchart the steps of a method in accordance with one 
embodiment of the present invention. 

Figure 15 shows a computer system upon which the present invention may 
5 be practiced. 
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DETAILED DESCRTPTION OF THE INVENTION 

In the following detailed description of the present invention, a method and 
system for efficiently evaluating and drawing NURBS sxirfaces for 3D graphics, 
nvimerous specific details are set forth in order to provide a thorough understanding of 
5 the present invention. However, it will be obvious to one skilled in the art that the 
present invention may be practiced without these specific details. In other instances 
well known methods, procedures, components, and circmts have not been described in 
detail as not to imnecessarily obscure aspects of the present invention. 

NOTATION AND NOMENCLATURE 

Some portions of the detailed descriptions which follow are presented in terms of 
procedures, logic blocks, processing, and other symbolic representations of operations 
on data bits within a computer memory. These descriptions and representations are 
the means used by those skilled in the data processing arts to most effectively convey 
the substance of their work to others skilled in the art. A procedure, logic block, 
process, step, etc., is here, and generally, conceived to be a self-consistent sequence of 
steps or instructions leading to a desired result. The steps are those requiring 
physical manipulations of phj'sical quantities. Usually, though not necessarily, these 
quantities take the form of electrical or magnetic signals capable of being stored, 
transferred, combined, compared, and otherwise manipulated in a computer system, 
it has proven convenient at times, principally for reasons of common usage, to refer to 
these signals as bits, values, elements, s3anbols, characters, terms, numbers, or the 
like. 

25 It should be borne in mind, however, that all of these and similar terms are to be 

associated with the appropriate physical quantities and are merely convenient labels 
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applied to these quantities. Unless specifically stated otherwise as apparent firom the 
following disciissions, it is appreciated that throughout the present invention, 
discussions utilizing terms such as "rendering" or "processing" or "computing" or 
"converting" or "generating" or "evaluating" or "drawing" or the like, refer to the action 
5 and processes of a computer system (e.g., computer system 1500 of Figure 15), or 
similar electronic computing device, that manipulates and transforms data 
represented as physical (electronic) quantities within the computer system's registers 
and memories into other data similarly represented as physical quantities within the 
computer system memories or registers, or other such information storage, 
10 transmission, or display devices. 

t £ 

K In one embodiment, the present invention comprises a computer 

f implemented process and system for rendering curved surfaces as 3D graphics on 

m a display. The system of the present invention includes a computer system 

tfl 15 having a processor, a bus, and a 3D graphics rendering pipeline. The surfaces are 
b modeled by non-uniform rational B-splines (NURBS). NURBS are well known and 

ry are widely used as mathematical parametric models of 3D objects. The process of 

the present invention functions by receiving a NURBS model for rendering from a 
software program ininning on the host processor. The NURBS m.odel defines a 
20 surface. The process of the present invention efficiently converts the NURBS 
model to a Bezier model using the hardware of the graphics rendering pipehne. 
Bezier models are similarly well known, and are also used as parametric models of 
3D objects. The Bezier model describes the same surface. The process of the 
present invention subsequently generates a pluraHty of points on the curve or 
25 sxuface using the Bezier model and the graphics rendering pipeline. The points on 
the curve or surface are then xised by the graphics rendering pipeline to render the 
curve or surface defined by the Bezier model. 
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In this manner, the present invention provides a method and a system 
which greatly increases the speed of the graphics rendering process. The method 
and system of the present invention increases the speed and efficiency of the 
5 process of rendering NURBS models. The method and system of the present 

invention accurately renders NURBS models without burdening the data transfer 
bandwidth of the computer system. In addition, the method and system of the 
present invention does not consume an inordinate amount of host processor or 
graphics co-processor clock cycles. In so doing, the present invention provides a 
10 fast and efficient process for rendering NURBS models. The present invention 
and its benefits are described in detail below. 

Figure 3 shows a graphics rendering system in accordance with the present 
invention. Graphics rendering system 300 includes a graphics rendering pipeline 

15 301 coupled to a central processing unit (CPU) subsystem 302. The graphics 
rendering pipeline is comprised of dedicated speciaHzed graphics rendering 
hardware and is designed to greatly accelerate the process of rendering 3D 
graphics. The graphics rendering pipeline 301 includes a geometry subsystem 
303, a raster subsystem 304, and a display subsystem 305, each coupled in 

20 pipeline fashion to CPU system 302. 

As is well known in the art, the process of creating realistic images involves 
a number of stages arranged in the pipeline fashion as shown in graphics pipeline 
301. The process of creating images fi-om models is referred to as rendering, and is 
25 described below. Those desiring a more detailed discussion of the rendering process 
and a graphics rendering pipeline should consult U.S. Patent Nvimber 5,230,039 of 
Mark S. Grossman, Kurt B. Akeley, and Robert A. Drebin, which is incorporated 
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herein as background material. Each subsystem (e.g., geometry subsystem 303) 
implements a stage of the rendering process. The subsystems of graphics pipeUne 
301 include dedicated special purpose graphics rendering logic implemented in 
hardware to greatly speed the rendering process. 

5 

With reference still to Figure 3, the CPU subsystem 302 nms the 
application program which creates the models and responds to and interacts with 
the user. The models are stored as database structures maintained in main 
memory. The CPU subsystem 302, running the application program, traverses 
10 the model database and sends high level graphics data to the graphics rendering 
pipeline 301 for rendering. 

The geometry subsystem 303 transforms and clips the graphics data from 
the models into screen coordinates. The geometry subsystem includes a plurality 
15 of floating point processors organized to perform geometric transformation 

calculations in parallel. Each floating point processor is referred to as a geometry 
engine. The geometry subsystem 303 further includes an input FIFO buffer to 
receive the graphics data from the CPU subsystem 302. 

20 The raster subsystem 304 breaks points, lines and polygon data (e.g., 

graphics primitives) into pixels. This process is referred to as rasterization. The 
raster subsystem 304 performs the calculations required to reduces the graphics 
primitives received from the geometry subsystem to individual pixels. Each pixel 
is assigned an x, y screen position, plus a number of parameters per pixel, 

25 including z, red blue, alpha, and texture parameters s and t. These parameters 
are linearly interpolated between the vertices and the edges of the primitives (e.g., 
polygons). 
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The raster subsystem also computes visibility and writes pixel data to a 
frame buffer. The raster subsystem 304 takes pixel data generated by raster 
subsystem 304 and selectively updates the image and z bit planes of the frame 
5 buffer, using the results of z comparisons and alpha blending. The raster 
subsystem also performs antialiasing. Additionally, raster subsystem 304 
includes a memory for storing "texels", or texture map elements. Each texel is 
composed of r, g, b, and alpha fields. Based upon the s and t parameters, raster 
subsystem 304 maps a textured pixel color onto each pixel stored in the frame 
10 buffer. 

The display subsystem displays the contents of the frame buffer on a 
computer display monitor. The display subsystem receives the pixel information 
from the frame buffer, routes it through digital to analog converters, and displays 
15 the resulting analog signal on a color monitor. The color monitor displays each 

pixel of the rendered image in accordance with the data describing each respective 
pixel and in the specified image format (e.g., 1600x1200 pixels). 

With reference now to Figure 4, a block diagram of the geometry subsystem 
20 303, the raster subsystem 304 in accordance with present invention is shown. 
The geometry subsystem 303 includes a FIFO buffer 306 coupled to receive 
graphics data via bus 313. As described above, geometry subsystem 303 
performs geometry transformations, as represented by transformer unit 307. 
The raster subsystem 304 includes logic which rasterizes the primitives received 
25 from geometry subsystem 303 and performs the texture mapping computations. 
This logic is implemented in rasterizer unit 308 and textiare mapping \mit 309. 
Texture memory 310 stores the individual texture map elements (i.e., texels) and 
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provides them to texture map unit 309. The raster subsystem 304 includes 
lighting and shading xonit 311 coupled to receive the pixel data from the texture 
map unit 309. After filtering and blending operations for lighting, shading, and 
antialiasing, the pixel data is stored in a frame buffer 312 and subsequently 
5 output to display subsystem 305 via bus 314. 

The method of the present invention fiinctions by using the standard 
graphics pipeline hardware to render NUEBS surfaces. The process of the 
present invention utilizes the textxu-e mapping xuiit 309 to directly render NURBS 
10 models. In one embodiment, the process of the present invention efficiently 

converts the NURBS model to a Bezier model using the hardware of the graphics 
rendering pipeHne. The Bezier model describes the same cur\''ed stirface. The 
process of the present invention subsequently generates a plurality of points on 
the surface using the Bezier model and the graphics rendering pipeline. 

15 

As described above, prior art methods do not natively render NURBS 
models with the dedicated rendering hardware (e.g., graphics pipeline 3G1). The 
NURBS models are transformed into polygon meshes by software executing on 
the host processor (e.g., CPU subsystem 302). Tlie polygon mesh is stored in a 
20 data base structure in main memory and traversed by the CPU subsystem 302 in 
accordance with the software. The resulting high level graphics data stream is 
- transmitted to FIFO 306 and the image is rendered "conventionally" in the 
manner described above. 

25 The present invention, however, utilizes the graphics rendering pipeline 301 

to implement a method of natively rendering NURBS models. The raster imit 
308, texture mapping unit 309, and texture memory 310 (each included within 
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raster subsystem 304) are designed and optimized for processing conventional 
graphics primitives and texel data. The present invention utilizes this existing 
hardware to natively render NURBS models without first converting them to 
polygon meshes in CPU subsystem 303. 

5 

With reference now to Figure 5, a block diagram of a tri-linear interpolator 
500 is shown. Tri-linear interpolators (e.g., tri-linear interpolator 500) are utilized 
extensively within texture mapping unit 309. As is well known in the art, tri-linear 
interpolator 500 includes seven linear interpolators 501 through 507. Linear 

10 interpolators 501, 502, and 503 form a bilinear interpolator. Two bi-linear 

interpolators combined with a common interpolator (e.g., Unear interpolator 507) 
form a tri-linear interpolator. Bilinear interpolation is used to interpolate texels in 
a 2D image. For example, Unear interpolator 501 interpolates between input iO 
and input il. An interpolation ratio is determined by a "weight" . An interpolation 

15 weight closer to 0.0 results in an interpolation output closer to iO, while and 

interpolation weight closer to 1.0 results in an interpolation output closer to il. 
Each linear interpolator is typically referred to as a "LERP", each interpolation 
weight is referred to a:', a "LERP weight", and hence, tri-linear interpolator 500 is 
refeiT3d to an a "LERP tree". Quantitatively, the function of each linear 

20 interpolator can be expressed as: t * il + (1-t) * iO, where t equals the LERP 
weight. 

As is well known in the art, bilinear interpolation is conventionally used to 
interpolate between four texels to produce a blended color for a fi-agment. Four 
25 texels, which surround the s and t textxare coordinates of interest, are used to 

generate the texture color of each firagment. Two pairs of texels, arranged in a 2x2 
array, are each pairwise interpolated horizontally to generate two new 
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interpolates. These interpolates are then interpolated vertically to generate the 
final filtered texture color (using interpolator 505). 

Tri-linear interpolation is used to interpolate between texels at different 
5 scales (i.e., texel screen projection size). Thus, as is well known in the art, tri- 
linear interpolation interpolates between the output of two bilinear interpolators. 
Hence, the output of interpolator 505 and the output of interpolator 506 is 
interpolated (by interpolator 507) to obtain a filtered color for the fi-agment. Thus, 
tri-linear interpolator 500 is often referred to as a filter. Tri-linear interpolators 
10 (e.g. , tri-linear interpolator 500) are utilized extensively in the hardware of texture 
mapping unit 309. 

The process of the present invention, however, uses tri-linear interpolator 
500 to process NURBS or Bezier control points, instead of texels. First, in the 

15 case of a Bezier curve, the process of the present invention uses tri-linear 

interpolator 500 to generate points on the surface defined by the Bezier model. 
The Bezier control points define a surface. Once the points on the surface are 
generated, they are feed back into FIFO 306 as graphics primitives and 
subsequently rendered into the surface defined by the Bezier model. The process 

20 of rendering the points on the surface into an image, occurs in the graphics 
rendering pipeline, in the manner described above. Thus, the process of the 
present invention renders the Bezier model into a surface without using the CPU 
subsystem (e.g., CPU subsystem 302) or data transfer bandwidth between main 
memory and the CPU subsystem or graphics co-processor. 

25 

Referring now to Figure 6, a spatial representation of four control points of 
a Bezier curve is shown. The shape of a Bezier curve is defined by the control 
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points bO, bl, b2, and b3. The control points bO, bl, b2, and b3 are shown in what 
is referred to in the art as a "de Casteljau form", thiis, the spatial relationship 
between the foxir control points and the lines connecting them as shown. The 
basic recursive de Casteljau algorithm is stated as: 

5 

Equation 1 

hf = bf * (u+1) + hl'^: - ui 

where bf = bi 

10 The term u is a parametric value and i and k are both indices of the recursion. 

The process of the present invention utilizes tri-linear interpolator 500 to 
evaluate the Bezier curve defined by the control points bO, bl, b2, and b3. The 
process of the present invention implements the de Casteljau algorithm in tri- 
15 hnear interpolator 500. 

Referring now to Figure 7A, the foiu: control points bO, bl, b2, and bS are 
shown plugged into the inputs of tri-linear interpolator 500. Each of interpolators 
501-504 produces a respective intermediate control point as shown (e.g., resulting 

20 point blO shown beneath interpolator 501). The intermediate control points are 
subsequently interpolated vising interpolators 505-507 to obtain a resulting point 
on the curve (e.g., bSO from interpolator 507). In this manner, the linear 
interpolators of tri-linear interpolator 500 are used to generate a series of points 
on the curve. Each "pass" through tri-linear interpolator 500 yields one point one 

25 the curve. In accordance with equation 1, by vaxrying the LERP weight used by all 
of tri-linear interpolators 501-507, the output of interpolator 507 yields varying 
points on the curve. The LERP weights numerically range fi-om 0.0 to 1.0. This 
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process of generating points on a curve defined by a cubic polynomial equation 
(e.g., a Bezier curve) in this form is the de Castlejau algorithm, and its execution is 
referred to as evaluation. 

Figure 7B shows the spatial relationship of the four Bezier control points 
bO, bl, b2, and b3 along with their respective intermediate control points and the 
resulting point on the curve, b30. As described above, each pass through tri-linear 
interpolator 500 produces a point on the curve. The location of this point is 
determined by the LERP weights used in each of interpolators 501-507. For 
example, Figure 7B shows the resiilting point on the ciirve (e.g., b30) in the case 
where the LERP weights are equal to 0.5. It should be noted that a LERP weight 
at one half the ntmierical range (e.g., 0.5) yields a point on the curve located at the 
mid point of the curve. Thus, a pass through tri-linear interpolator 500 vising a 
LERP weight of 0.25 3delds a resulting point on the curve located one quarter of 
the way along the length of the curve. Li this manner, successive passes are 
made through tri-linear interpolator 500 using differing LERP weights (e.g., 0.0, 
0.25, 0.5, 0.75, and the like) yielding a series of points on the curve, and ultimately, 
a series of points on the surface. 

Figure 7C shows a curve 704 defined by Bezier controj points bO, bl, b2, 
and bS. As described above, the resiilting point bSO on the curve 704 is located at 
the midpoint of the cxarve. As successive passes are made through tri-linear 
interpolator 500 with differing LERP weights, differing points on cvxve 704 are 
defined. Using these resulting points, curve 704 is rendered by the graphics 
rendering pipeline 301 (shown in Figure 3). The resulting points are feed back into 
FIFO 306 as graphics primitives and subsequently rendered into the curve defined 
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by the Bezier model. The rendering process subsequently proceeds in the graphics 
rendering pipeline, in the manner described above. 

In this manner, the process of the present invention evaluates and renders 
5 Bezier curves. Bezier curves are directly evaluated in raster subsystem 304. By 
using the dedicated tri-Unear interpolators of texture mapping imit 309 (shown in 
Figure 4), the process of the present invention implements in fast hardware what 
was conventionally implemented in software on either the CPU subsystem or 
graphics co-processor, thus greatly accelerating the rendering process. 

10 

With reference now to Figure 8A and Figure 8B, the tri-linear inteiT3olator 
500, and four Bezier control points and intermediate control points, are 
respectively shown. The four Bezier control points and intermediate control points 
are shown using blossom notation. Blossom notation comprises another form of 
15 representing the de Casteljau algorithm and is well known in the art. The terms of 
the blossom notation represent the repeated Hnear interpolations carried out by 
tri-linear interpolator 500. Particularly, the blossom form is defined as follows: 

Equation 2 
20 abu = abc * (1-u) + abd * u 

where a, b, c, and d are control point values and u is an arbitrary parametric 
variable. Thus, Figures 8A and SB are the same as Figures 7A and 7B except for 
the fact that they are illustrated using the blossom form. 

25 

Accordingly, in order to evaluate and render NURBS models, the NURBS 
models first need to be translated into Bezier models. Once the NURBS model is 
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translated into a Bezier model, the process of the present invention evaluates and 
renders the Be2der model in the manner described above. The blossom form 
provides an efficient way of describing NURBS ciorves. The present invention 
translates NURBS models into Bezier models to facilitate fast and efficient 
5 evaluation. Bezier models are inherently well suited to relatively fast evaluation 
iising tri-linear interpolators. However, NURBS models represent useful shapes 
more efficiently than Bezier models. Hence, NURBS models are more widely used 
in the CAD field. 

Figure 9A shows the tti-linear interpolator 500 and Figure 9B shows a 
NURBS to Bezier conversion. Control points 012, 123, 234, and 345 comprise 
NURBS control points for a NURBS model. These NURBS control points are fed 
into tri-linear interpolator 500 in the manner shown in Figure 9A. Each pass 
through tri-linear interpolator 500 yields a Bezier control point at the output bk- 
Thus, four passes are required to yield Bezier control points 222, 223, 233, and 
333. Generally, the four Bezier control points define a curve equivalent to that 
defined by the four NURBS control points 012, 123, 234, 345. In a manner 
similar to the Bezier evaluation described above, the LERP weights of each of the 
interpolators are varied to obtain the desired Bezier control point. The LERP 
weights are varied in accordance with a table shown in Figure 10. 

Figure 10 shows a table (e.g., table 1) illtostrating the relationship between 
the LERP weights of tri-linear interpolator 500 and the resulting Bezier control 
points. Thus, in accordance with table 1, to obtain Bezier control point 222, the 
25 LERP weights for the particular interpolators are as shown in the 222 column, to 
obtain Bezier control point 223, the LERP weights for the particular interpolators 
are determined via the terms shown in the 223 colimin, and so on. In this manner, 
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the process of the present invention determines Bezier control points 222, 223, 
233, and 333 from the NURBS control points 012, 123, 234, and 345. In the 
present embodiment, table 1 is computed on the CPU subsystem (e.g., CPU 
subsystem 302) ahead of time in double precision. 

5 

It shoxild be appreciated that the equations shown in table 1 result from the 
geometric relationship between the control points of NURBS curves and the 
control points of Bezier ctirves. The equations shown in table 1 can be derived 
directly form the symmetry of the blossom function. For example, in the cubic 
10 case, going from B-spline control points A, B, C, D, and knot values a, b, c, d, e, and 
f, Bezier control points A', B*, C, and D* are derived using the following equations: 

B' = ((c-b)/(e-b)) * (C-B) + B 
C' = ((d-b)/(e-b))*(C-B) + B 
15 X = ((c-a)/(d-a)) * (B-A) + A 
Y = ((d-c)/(f-c)) * (D-C) + C 
A' = ((c-b)/(d-b))*(B*-X) + X 
- ((d-c)/(e-c)) * (Y-C) + C 

20 where X and Y are temporaiy variables. 

The above equations implemented in a tri-linear interpolator 500 as shown 
in Figure 8A and Table 1 of Figure 10. Thus, in a hardware implementation, there 
are four separate LERP weights and four control points received from two 
25 "textures". Thus, the control points would be the NURBS control points taken 

four at a time from a control "mesh", and the four LERP weights would come from 
another texture or lookup table generated from the knot sequence. 
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Before the conversion from NURBS to Bezier, the NURBS cxirve needs to 
be transformed from the global to the local domain. The present invention 
implements this function by transforming from the global NURBS domain to the 
local Bezier domain. The global to local transformation is required due to the fact 
that, in the present embodiment, the NURBS curve and the Bezier curves 
resulting from the translation process are both defined over the interval [0, 1]. As 
is well known in the art, a NURBS model includes "knot vectors" which, in addition 
to the NURBS control points, define the curve. The transformation from the 
global to the local domain is governed by the equation: 

Equation 3A 

u - c^ 

^ = T:e 

where u equals the b-spHne local parametric value, ti* equals the NURBS global 
parametric value, and^* and ^ are both equal to a knot value from the NURBS 
curve. The present invention solves these terms and subsequently uses them 
with tri-linear interpolator 500 to convert NURBS models into Bezier models. 

The other component of the global to local transform is the association 
between b-sphne domain integral and the parameter value, u. As is well known in 
the art, for each value of u, there is a "block index", b, which identifies the location 
of the set of NURBS (and Bezier) control points which affect the surface shape at 
that parameter value, u. 

Equation 3B 

b<u<b+l 
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This equation can be encoded in a texture map or lookup table, in a fashion very 
similar to the encoding of the global to local parameter transformation described 
above. 

5 

Thus, after the NURBS cxirve is converted to a Bezier curve, the present 
invention evaluates and renders the Bezier ctirve in the manner described above. 
It should be appreciated that the conversion process takes place within the 
graphics pipeHne hardware. As such, the present invention does not place undue 
10 data transfer bandwidth loads on the computer system. 

It should be appreciated that each b spline domain interval defines an 
independent Bezier curve. To be more precise, for each B-spline domain interval 
with a sufficient number of knots to either side, the B-spline domain interval 
15 defines an independent Bezier curve. B-spline domain segments near the 
beginning and the end of a B-spline control point Ust do not have a sufficient 
number of knots or control points defined to determine a Bezier cur'/e. A cubic B- 
spiine curve with n vertices generates up to n-3 Bezier cxrrves, generating 4(n-3) 
Bezier vertices. 

20 

Now consider the case of a Bezier sxirface patch. Points are evaluated on a 
surface in two stages. The Bezier patch consists of a 4x4 array of points. The 
present invention needs to evaluate at point "P" with coordinates of (s, t). In the 
first stage, the four rows of the patch are considered as Bezier ciirves, and 
25 evaluated at the s coordinate of each of these curves. In the second stage, the 
resulting four points firom the first stage are treated as four points of a single 
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Bezier curve. The present invention evaluates this cxirve at coordinate t, thereby 
obtaining the desired point P. 

With reference now to Figure llA and Figure llB, a process in accordance 
5 with an alternative embodiment of the present invention is shown. In this 

alternate embodiment, the NURBS model is directly rendered into a surface, as 
opposed to first being converted into a Bezier model and subsequently being 
rendered as described above. This embodiment implements a "Mansfield 
algorithm". 

10 

Figure llA shows a plurality of look up tables 1150. The look up tables 
1150 are indexed with texture parameters s, and t. These parameters are 
mapped by look up tables 1150 into a series of LERP weights and a plurality of 
NURBS control points implementing a global to local transformation. 

15 

Figure HE shows a tri-linear filter 500 which implements the direct 
NURBS e'/aluation. Tri-Hnear filter 500 includes two rows of inputs. The first 
rov^ of inputs is comprised of NURBS control points A, B, C, D, E, F, G, and H. 
The second row of inputs is comprised of NURBS control points I, J, K, L, M, N, 0. 

20 and P. Each of the linear interpolators 501-507, as described above include a 

LERP weight parameter. The LERP weight parameters are shown as al, a2, a3, 
a4, a5, a6, and a7, respectively. In this embodiment, the output of interpolator 
507 is coupled to a register and to one input of an interpolator 1160. The register 
is also coupled to interpolator 1160 via its second input. Interpolator 1160 uses 

25 LERP weight parameter a8. The output of interpolator 1160 is bk, a point on the 
surface. When used as described above, the tri-linear interpolator 500 performs 
what is referred to as a "separable convolution". 
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The process of the present embodiment evaluates the surface s(u,v) by 
implementing the following equation: 



Equation 3C 

S(u,v) = IlNf (u)Nf (v) dij 

i j 

Np^u) N[;l(u) 

for U, Ni (u) = (U - ui) — + (Ui+r+l - u) . ^ ^ 



where Nf = 



1 e[UiUi^.i] 

0 ^^U[uiUi^i] 



Note that there is an equivalent expression for v. 
10 Additionally, note that dij are the local NURBS control points for the 

parameter value (u,r). The terms u and v are parameters of the surface, i and j 
are indices of the control points, and r is an index of the recursion. 



By implementing equation 3C, the process of the present embodiment is able to 
15 directly evaluate a NURBS model into a surface. This process does not require 
the prior conversion of the NURBS model into a Bezier model. In order co 
implement equation 3C, the LERP weights (al-a8) are loaded as described below. 

In the general case, (e.g., ID) a convolution equation is expressed as: 

20 

Equation 3D 

m 

C(x)= XK(h)I(x-h) 
h=0 



Where C denotes the convolved result at point x, I is the input image, and K being 
25 the m convolution kernel coefficients. Given equation 3D, the present invention 
derives the alpha values (i.e., LERP weights) used by the LERPS in the tri-linear 
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interpolator 500 based on the kernel values. The present invention extrapolate 
equivalencies between the original kernel coefficients based on going through the 
tri-linear interpolator 500: 

The original ID convolve is expressed as: 

koio+kiii +k2i2+k3i3+k4i4+k5i5+k6i6+k7i7 

Using tri-linear interpolator 500, the expression is: 

Equation 3E 

(1-a) /'i Z>Z2+ il-b)i^ ci^-{- (1 -c)/^ di^-h (1 -d)ij 

e (l-e) ^ f (1-y) 

>■ ill —— / ^ " ' " ■ ' 1^ II ■ ■ II ■ ■ — ■ ■ ■!> 

g (1-g) 

' "' " V ~ ■ I - I - . I. 

h (scale) 

where a though g equal LERP weights and h equals the scale, and ix are the inputs 
of the input image- For the case of convolutions, the last LERP h fusing this 
notation) is assigned h= 1.0. 

The last step for convolutions is to scale the output to de-normahze the 
result. We can form equivalencies between the K's and the alphas by algebraic 
manipulation as follows: 

Equations 3F 
k() = a*e*g*h 
ki = (l-a)*e*g*h 
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k2 = b.(l-e)*g*h 
k3 = (l-b)*(l-e)*g*h 
k4 = c*f*(l-g)*h 
k5 = (l-c)*f*(l-g)*h 
5 ke = d*(l-f)*(l-g)*h 
k7 = (l-d)*(l-f)*(l-g)*h 

Equations 3F also take into accovmt that the LERP alpha values are 
normalized. Additionally, before alpha values for each LERP are extrapolated, the 

10 present invention also needs to accotint for possible divide by zeros. These could 
be caused by either very sparse kernel matrices or coefficients that may cancel 
each other out (as is the case for some edge detectors). To compensate for these 
effects, the present invention works with the absolute values of each kernel 
coefficient and compensates by negating one of the inputs when the coefficient is 

15 negative. Given this, the LERP alphas (i.e., LERP weights) are derived as: 

Equations 3G 

|ko| 
|ko| + |ki| 

. |k2| 
20 b = i — — — r 

k2 + ka 



k4 



c = 



d = 



^41 + Iksl 
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20 



e = 



|kok|ki| 
|ko| + |ki| + |k2| + |k3| 



|k4| + |k5 



f= 



k4| +|k5| + |k6| + |k7| 
|k4| +|k5| 



S = ] 



k4| +|k5| + |k6| + |k7| 

|ko| +|ki| +|k2| + [ksl 

ko| + |ki| + |k2| + Iksl + |k4| + [ksl + [kel + Ik?] 



10 h = |ko| +|ki| +|k2| + Iksl +|k4| + Iksl +|k6| + |k7| 



** 



where a, b, c, d, e, f, and g, are LERP weights. The last LERP, h, can be used to 
accumulate the current result with the previous pass. However, because the 
present invention is interpolating as opposed to simple acctunulating, the present 
15 invention needs to account for the contributions of previous passes (i.e., doing an 
internal multi-pass). Thus, the h for pass j would really be represented as: 



Equation 3H 

X|kj| 



h^ = 



fl(l-hi) 



1=J 



Where j is the current pass, hi is the alpha value for pass i, n is the total nvimber 
of passes, and the sum of ] Kj | is the magnitude (or sxmi) of all of the kernel 
coefficients corresponding to pass j. One alternative is to bypass the last LERP, 
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and use the accxunulation logic provided in the "pixel pipe" of the graphics pipeline. 
This ensures better precision and leaves the h alpha value as an expression 
dependent solely on the coefficients of the current pass. Because these values are 
normalized, a pre-processing step that is required with this alternate approach is 
5 that the present invention needs to first normaUze the K*s before performing the 
conversions and scale back to de-normalize on the last pass. 

As described above, to compensate for negative kernel coefficients, the 
present invention negates the inputs based on the following: 

10 

neg B for LERP a = sign(kO) xor sign(kl) 
neg B for LERP b = sign(k2) xor sign(k3) 
neg B for LERP c = sign(k4) xor sign(k5) 
neg B for LERP d = sign(k6) xor sign(k7) 
1 5 neg B for LERP e = sign(kl) xor sign(k3) 
neg B for LERP f = sign(k5) xor sign(k7) 
neg B for LERP g = sign(k3) xor sign(k7) 
sign of scale factor = sign(k7) 



20 In addition, the texture filter (e.g., tri-linear interpolator 500) can also swap inputs 
A and B if necessary. For instance, if ko is negative (and ki positive), the A input 
must be negated. So, for the "a" LERP, the inputs are swapped and B negated. 



Referring still to Figures llA and IIB, the present invention uses look up 
25 tables 1150 to implement the global to local transformation. Texture parameters 
s and t are used to index the look up tables 1150 to obtain a set of NURBS control 
points (i.e., the set of NURBS control points A though P) appropriately indexed for 
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the local domain. These NURBS control points comprise a portion of an array of 
NURBS control points, which in turn, comprise the NURBS model. The NURBS 
control points A through P comprise a global to local transformed 4x4 array. 
Additionally, the look up tables yield the LERP weight parameters al through a8. 
5 The LERP parameters are utiHzed by interpolator 500 to evaluate the NURBS 
control points A through P. This causes tri-linear interpolator 500, as shown in 
Figure IIB, to implement convolution. 



10 interpolator 500 in two separate stages. In the first stage, NURBS control points 
A through H are fed through, using LERP weight parameters al through a7. .The 
output of interpolator 507 (e.g., the first resxilt) is stored in the register. In the 
second stage, NURBS control points I through P are fed through. This time, the 
output of interpolator 507 (e.g., the second result) is coupled to the first input of 

15 interpolator 1160 and the first result is coupled to the second input of interpolator 
1160. The interpolator 1160 combines the first and second results, using LERP 
weight parameter a8, to generate bk, a point on the surface. Thus, tri-linear 
interpolator 500 is utiHzed to implement a quadri-linear interpolator as shown in 
Figui^e IIB. Again, it should be noted that tri-linear interpolator 500, in the above 

20 manner, computes a convolution of the NURBS control points. 

Hence, the present invention generates a plurality of points on the surface. 
These points are fed back through the graphics rendering pipeline and are 
subsequently rendered into a resulting siarface. Thus, the present invention, in 
25 accordance with this alternate embodiment, directly renders a surface firom a 

NURBS model. As with the previous embodiment, the NURBS model is rendered 
using the graphics rendering pipeline (e.g., graphics rendering pipeline 301). 



The NURBS control points A through P are fed through tri-Hnear 
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With reference now to Figure 12A and Figure 12B, a diagram 1200 and a 
diagram 1201 are respectively shown. Diagram 1200 and diagram 1201, in 
accordance with the present invention, represent the processes for implementing 
5 automatic normal generation. In diagram 1200, block 1202 is the global to local 
domain transformation, block 1203 is the evaluation process of generating points 
on the curve, as described above. Blocks 1204 and 1205 are the Su and Sy partial 
generation processes. From these partials, the Su and sy tangents are derived in 
blocks 1206 and 1207. From these tangents, the normals to the surface are 

10 derived in block 1208. In diagram 1201, the points on the curve are determined, 
as described above, in block 1209. The Su partials are determined in block 1210. 
From these, the sv partial/normals are determined in block 1211. It should be 
noted that the su and sv partials can be computed in conjunction with either 
surface evaluation embodiment described above (e.g., evaluation via the NURBS 

15 to Bezier conversion process, or direct NURBS evaluation via the convolution 
process). 

Diagram 1200 generates analytically correct normals by computing the 
cross product of u-tangents and v tangents. The u-partials are defined by the 
20 equation: 

Equation 4 

Su = [xu yu Zu wj''' 
where Su and Sy are the surface partials. 
25 The partials may be computed using either method (i.e., either the NURBS to 
Bezier conversion or the Mansfield convolution). Equation 4 similarly defines Sy, 
the v-partials. The u-tangents are defined by the equation: 
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Equation 5 

Su =WSu-WuS 

5 The anal3^ic normal direction is thus: 

Equation 6 

n= Su X Sv 

10 Diagram 1201 shows an alternative to diagram 1200 and produces a 

reasonable approximation in less time. The difference between diagram 1200 and 
diagram 1201 is shown in equation 7 below: 

Equation 7 

15 "s^ X "s7 = w2(su X Sv) + w(s X (SuWu - SvWv)), SO if SuWu - SyWv = 0, then Su x Sy 

Su X Sv and n ~ Su x Sv 

In accordance with the present invention, surface partials are computed by 
the tri-hnear filter (e.g., tri -lineaj' filter 500). It should be noted that for surface 
20 partials, the surface partial of a point evaluated on a Bezier curve is equal to: 

Equation 8 

2uu - uu3 
2 

25 

Thus, equation 8 is computed by negating the second input to interpolator 507 in 
tri-linear interpolator 500 and using a LERP weight equal to 1/2. 
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Figure 13A and Figure 13B show tri-linear interpolator 500 implementing 
surface partial evaluation in accordance with the present invention. It should be 
noted that the inputs to tri-linear interpolator 500 are the same as they were 
5 during the Bezier evaluation shown in Figure 8A (which is similarly the same for 
the convolution process described above). The output of tri-linear interpolator 500 
is the resulting partial vector (e.g., Su). 

The present invention subsequently uses the graphics pipeline hardware, 
10 specifically, a "blender" included within raster subsystem 304. Since, equation 5 is 
very similar to operations already implemented in the hardware of the blender, the 
present invention utilizes the blender to compute tangents from partials. The 
present invention uses a two step process. In the first step, the partials are 
evaluated into the frame buffer of raster subsystem 304 as a vertex mesh. In the 
15 second step, the surface points are fed back through the rasterizer where they are 
blended with the partial mesh. The blending functions are arranged according to 
equation 9: 

Equation 9 
20 r =r3Ctd-rdag 

[x,y,z,w]'r is sent as (rs,gs,bs,ag) data and (rd,gd,bd,ocd) is equal to [xu,yu,Zu,Wu] 
By xising a subtract blending mode, the blender effectively computes: 
25 Equation 10 

XWu - XuW 
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Normals are also computed as cross products in the blender using a new blending 
factor referred to as a "cycUc shift". Cyclic shift source factors operate as follows: 

Equation 11 

r =rsgd-rdgs 
g =gsbd-gdbs 
b = bgrd - bdrg 
a=0 

The cross product desired is: 

Equation 12 

nx = YuZv - YvZu 

Ily — Z^Xy ■ ZyXy^ 

nz = XuYv - XvYu. 

Instead, what is actually computed is: 

Equation 13 
[nznxHy] 

Thus, in accordan ce with the process of the present invention: 
Equation 14 

[xuJuZuy]'^ is sent as (rg, gs, bg, ag) [xyyvZyx]'^ is arranged to be (rd, gd, bd, ocd) and 
thus, [nznxnyO]'^ is computed as desired. 

With reference now to Figure 14, flowchart the steps of a method 1400 in 
accordance with one embodiment of the present invention is shown. In step 1401, 
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a NUEBS model is received for rendering. In the present embodiment, the 
NURBS model is generated by a software program executing on the CPU 
subsystem. As described above, the NURBS model represents a 3D object. 

5 In step 1402, the process of the present invention converts the NURBS 

model into a Bezier model. This involves converting the individual NURBS curves 
into corresponding Bezier curves. The present invention utilizes the tri-linear 
interpolators included with the hardware of the graphics pipeline to implement the 
NURBS to Bezier conversion process. Each NURBS curve to be rendered is 
10 converted into equivalent Bezier curves as described above. This is accomplished 
by performing curve conversions in the u direction and then in the v direction. 

In step 1403, the process of the present invention generates points on the 
curve defined by the Bezier control points of the Bezier model. As described above, 
15 the Bezier control points are obtained via the NURBS to Bezier conversion 

process. The Bezier control points define a Bezier curve. The present invention 
uses the tri-linear interpolators to generate points on this cvirve. Again, this 
equation is performed in &cst the u direction and then in the v direction. 

20 In step 1404, the present invention comptites normals to the surface. As 

described above, the Su and sy partials are generated using the tri-linear 
interpolators. From these partials, the su and sv tangents are derived using the 
blender hardware. From these tangents, the blender hardware is used to generate 
the normals to the surface. The normals are subsequently used to realistically 

25 render lighting effects on the surface. 
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In step 1405, the present invention renders the sinface defined by the 
generated points and normals. As successive points are rendered, the graphics 
pipeline creates primitives (e.g., "quads") to render a surface. The curves in one 
direction (e.g., in the x direction) are rendered, the curves in another direction (e.g., 
5 the y direction) are rendered, creating a line drawing. Alternatively, the curves are 
coupled to form a quadrilateral mesh defining a surface. 

Thus, the present invention provides a method and a system which greatly 
increases the speed of the graphics rendering process. The method and system of 

10 the present invention increases the speed and efficiency of the process of 

rendering NURBS models. The method and system of the present invention 
accurately renders NURBS models without burdening the data transfer 
bandwidth of the computer system. In addition, the method and system of the 
present invention does not consimie an inordinate amount of host processor clock 

15 cycles. In so doing, the present invention provides a fast and efficient process for 
rendering NURBS models. 

Referring now to Figure 15, a computer system upon which the present 
invention may be practiced is shown as 1500. Computer system 1500 includes 

20 any computer controlled graphics systems for generating complex or 3 

dimensional images. Computer system 1500 includes a bus 1501 for transmitting 
digital information between the various parts of the computer system. One or 
more microprocessor(s) 1502 are coupled to bus 1501 for processing information. 
The microprocessor(s) 1502 comprise CPU subsystem 302 shown in Figure 3. 

25 The information along with the instructions of how the information is to be 

processed are stored in a hierarchical memory system comprised of mass storage 
device 1507, read only memory 1506, main memory 1504, and static random 
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access memory (SRAM) 1503. Mass storage device 1507 is used to store vast 
amounts of digital data. The mass storage device 1507 can consist one or more 
hard disk drives, floppy disk drives, optical disk drives, tape drives, CD ROM 
drives, or any number of other types of storage devices having media for storing 
5 data digitally. A read only memory (ROM) 1506 is used to store digital data of a 
permanent basis, such as instructions for the microprocessors. Main memory 
1504 is used for storing digital data on an intermediate basis. Main memory 1504 
can be dynamic random access memory (DRAM). 

10 The 3D graphics rendering pipeline 301 is also included in system 1500. As 

described above, processor(s) 1502 provides the graphics rendering pipeline 301 
with graphics data, such as drawing Commands, coordinate vertex data, and other 
data related to an object's geometric position, color, texture, shading, and other 
siirface parameters. The display subsystem 305 displays the rendered image on 

15 monitor 1521. 

Several other devices may also be coupled to system 1500. For example, 
an alphanumeric keyboard 1522 is used for inputting commands and other 
information to processor 1502. Another type of user input device is cursor control 

20 device 1523 (a mouse, trackball, joystick, and the Uke) used for positioning a 

movable cursor and selecting objects on a computer screen. Another device which 
may be coupled to bus 1501 is a hard copy device 1524 (e.g., a laser printer) for 
printing data or other information onto a tangible mediimi. Additionally, a sound 
recording or video option 1525 can be coupled to the system 1500 to provide 

25 multimedia capabiUties. 
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# • 

The present invention, a method and system for efficiently drawing NUEBS 
sxarfaces for 3D graphics, is thus disclosed. While the present invention has been 
described in particular embodiments, it should be appreciated that the present 
invention should not be construed as limited by such embodiments, but rather 
construed according to the below claims. 
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